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FAST COMPUTATION OF OVERFLOW FLAG IN A BIT MANIPULATION UNIT 

BACKGROUND OF THE INVENTION 

Field of the Invention 

5 The present invention relates to circuits for digital signal processing systems, and, more 

particularly, detecting the overflow flag of a barrel shift operation. 

Description of the Related Art 

Bit manipulation units (BMUs) are circuits commonly employed in digital signal processors 
1 0 (DSPs), micro-controllers, or other types of processors to scale and format data. Typical instructions 
executed by a BMU include: (a) arithmetic shift left or right, (b) logical shift right, and (c) normalize. 
Pseudo-code equations (1) through (4) below represent these operations: 

aD0=aS0»aSl /^arithmetic shift right */ (1) 

aD0=aS0«aSl /^arithmetic shift left */ (2) 

1 5 aD0=aS0»>aSl /^logical shift right */ (3) 

aD0=norm(aS0,aDl) /^normalize */ (4) 

where aDO and aDl represent resulting output data written to corresponding destination accumulators, 
and aSO and aSl represent input data stored in corresponding source accumulators. For an arithmetic 
shift right operation, redundant sign bits are shifted in from the left, while for a logic shift right 
20 operation, padding bits (logic "0" or logic "1") are shifted in from the left. For an arithmetic shift left 
operation, padding bits are inserted from the right. For a BMU, an operation for logical shift left is 
generally not defined since a logical shift left operation generates the same numerical result as an 
arithmetic shift left operation. 

A BMU executes one operation in one clock cycle. In the DSP 16000 processor available from 
25 Lucent Technologies of Murray Hill, NJ, for example, the source and destination accumulators are 40- 
bits wide; each accumulator may contain up to 32 bits of data and eight guard bits for overflow, and have 
a maximum shift amount of 31 bits. FIGs. 1(a), 1(b), and 1(c) illustrate changing bit positions for the 
three shift operations of equations (1), (2), and (3), respectively, in the exemplary prior art processor. 
For the normalize operation of equation (4), the BMU shifts the bits of the 40-bit input data an 
30 appropriate shift amount (number of bit positions) to represent the value with a 32-bit result and an 
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exponent. 

FIG. 2 shows logic circuit 200 for an exemplary implementation of a prior art BMU 201 and 
external accumulator 202. The circuit 200 has three primary input values: (a) the data aS0 t which is a 
sequence of bit values, (b) the shift value, which is the direction (indicated by the sign bit of the shift 
5 value) and number of bit positions by which the data is to be shifted corresponding to the shift amount, 
and (c) the saturation mode value SAT The saturation mode value SAT identifies whether the output 
values are to be limited to a 32-bit sequence. The data aSO is provided to BMU 201 from, for example, 
40-bit wide first source accumulator included in external accumulator 202. The shift value may be the 
shift value aSl that is provided to BMU 201 from the upper half of a second source accumulator, that 

1 0 may be included in external accumulator 202, or the shift value may be an immediate value from the 

instruction itself. The shift value aSl may be limited to 6 bits by limiter 210. The saturation mode value 
SAT is provided from an external control register (not shown) that is set by an instruction. If the 
saturation mode value SAT is enabled, as described subsequently, then a result generated by BMU 201 
may be limited to a length of 32 bits. In addition, the input value SHIFT identifies whether an 

1 5 arithmetic/logic shift operation or a normalize operation is to be performed. 

The circuit 200 has three primary output values: (a) two 40-bit results aDO and aDl written to 
external accumulator 202 (or other register file), and (b) an overflow flag LMV that is set and written to a 
status register (not shown). The input value SHIFT applied to multiplexer (mux) 211 selects either 1) the 
shift value from limiter 210 for an arithmetic shift left or right operation or 2) the shift amount for a 
20 normalize operation that is generated by exponent/compare module 205. Barrel shifter 203 performs a 
shift, either left or right, of the input data aSO based on the decoded input SV from mux 211. Barrel 
shifter 203 may include a decoder to translate the value SV from mux 211 into 1) the number of bit 
positions to shift the input data aSO and 2) whether to shift the input data aSO left or right. 

The result aDO is the output of a barrel shift operation applied to the input sequence aSO by barrel 
25 shifter 203, where the sequence length of the result may be limited by saturate module 204. When the 
overflow flag LMV is not set, the result aDO may be represented with 32-bits, and so limiting the result 
from barrel shifter 203 with saturate module 204 is not necessary (SAT is in a "don't care" state). 

When the overflow flag LMV is set, the result aDO may not be represented with 32 bits (i.e., an 
overflow occurs). Whether a 32-bit or 40-bit result aDO is written to the external accumulator is 
30 determined by the output signal of AND gate 208 applied to saturate module 204. AND gate 208 

generates as an output signal the logical AND of 1) the overflow flag LMV and 2) the saturation mode 
value SAT If saturation mode value SAT is enabled, then the output of AND gate 208 causes saturate 
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module 204 to limit the result aDO to the range 0xff80000000 to 0x007fffffff (Hex). If saturation mode 
value SAT is disabled, then the output of AND gate 208 causes saturate module 204 to pass the entire 40- 
bit result aDO to the external accumulator 202. 

For the normalize operation, the BMU 201 shifts the input bits by an appropriate shift amount to 
5 represent the value with 32-bits. The shift amount is referred to as the "exponent" and the result aDl is 
the exponent for the normalize operation. The (e.g., 6-bit) exponent is generated by exponent/compare 
module 205 based on the input data aSO, and exponent/compare module 205 also generates the logical 
overflow value LLV component of the overflow flag LMV (described below). 

The overflow flag LMV is a value comprising two components. The first component is the 
1 0 logical overflow value LL V determined by exponent/compare module 205. Exponent/compare module 
205 compares the 6-bit value SV from the mux 211 to the 6-bit exponent generated for the input data 
aSO. The exponent also indicates the number of redundant sign bits in the input data aSO. If the shift 
value is greater than the number of redundant sign bits, a logical overflow has occurred, and 
exponent/compare module 205 sets the logical overflow value LLV to one. For the described exemplary 
1 5 implementation of the prior art, the logical overflow value LLV is only set to one on left shifts and, by 
definition, is set to zero on right shifts. The second component of the LMV, sometimes referred to as the 
math overflow value, is determined after the input has been shifted. A test determines if the bits in the 
positions corresponding to the guard bits and the most significant bit of the input data aSO, such as bits 
39 to 3 1 in the exemplary embodiment, and the result aDO are not all the same. Compare module 212 
20 applies the test to the output aDO prior to application to saturate block 204, and the overflow flag LMV is 
set when the test determines the bits are not the same. As shown in FIG. 2, the overflow flag LMV is 
generated by OR gate 209 as a logic OR of 1) the logical overflow value LL V and 2) the math overflow 
value of the compare module 212. In some implementations, the logical overflow value LLV is a 
separate flag and the LMV is only set if all of the guard bits are not the same as bit 3 1 . 



25 



SUMMARY OF THE INVENTION 



The present invention relates to faster computation of the overflow flag LMV in a bit 
manipulation unit (BMU). For fast computation, the overflow flag is computed based on the input data 
and the shift amount. The overflow flag is calculated separately as either a LMV\ e f t for an arithmetic 
30 shift left operation or LMV r igfo for an arithmetic shift right operation. Calculation of LMVfeft and 
LMVyjgfa for an exemplary embodiment may be as follows. For an arithmetic shift left operation, 
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LMVi e ji may be computed by first adding one plus the number of guard bits in the input data to the shift 
amount, and then detecting the number of redundant sign bits. Hence, for an arithmetic shift right 
operation, LMVyigfa may be computed by combining the total number of redundant sign bits in the input 
data with the shift amount since a shift right essentially adds more sign bits. By computing the overflow 
5 flag separately as LMV\ e fi and LMV r igfo t for arithmetic left and right shifts, respectively, the overflow 
flag LMVis determined in parallel with the barrel shift operation and so does not depend on the result 
from the barrel shift operation. Since calculation of the overflow flag is part of the critical path of 
computation in, for example, a BMU, an advantage of the present invention may be a relative reduction 
in the time necessary for a BMU to calculate the overflow flag. 

10 

BRIEF DESCRIPTION OF THE DRAWINGS 

Other aspects, features, and advantages of the present invention will become more fully apparent 
from the following detailed description, the appended claims, and the accompanying drawings in which: 

FIG. 1(a) illustrates changing bit positions for an arithmetic shift right operation of a bit 
1 5 manipulation unit (BMU) in a prior art processor; 

FIG. 1(b) illustrates changing bit positions for an arithmetic shift left operation of a bit 
manipulation unit (BMU) in a prior art processor; 

FIG. 1(c) illustrates changing bit positions for a logical shift right operation of a bit manipulation 
unit (BMU) in a prior art processor; 

20 FIG. 2 shows logic circuit for an exemplary implementation of a prior art bit BMU and external 

accumulator; 

FIG. 3 shows a logic circuit for an exemplary implementation of a BMU in accordance with the 
present invention; 

FIG. 4(a) shows the normalized, relative circuit delay for steps generating the value of LMV for 
25 the prior art circuit of FIG. 2; 

FIG. 4(b) shows the normalized, relative circuit delay for steps generating the value of LMV r jgh t 
for the arithmetic shift right operation for the circuit of FIG. 3; 

FIG. 4(c) shows the normalized, relative circuit delay for steps generating the value of LMVfeft 
for the arithmetic shift left operation for the circuit of FIG. 3. 
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DETAILED DESCRIPTION 

In accordance with exemplary embodiments of the present invention, a bit manipulation unit 
(BMU) generates the overflow flag for an operation using fast computation of the overflow flag in shift 
5 left and shift right operations, rather than for math and logical overflow values. The overflow value is 
computed only for arithmetic shift left and right operations, since for other BMU operations, the 
overflow flag is set to zero. 

FIG. 3 shows a circuit 300 for an exemplary implementation of a BMU 301 and external 
accumulator 302 operating in accordance with an embodiment of the present invention. Inputs to circuit 

10 300 include (a) the data aSO, which is a sequence of bit values, (b) the shift value aSl, which is the 
number of bit positions by which the data is to be shifted, and (c) the saturation mode value SAT The 
data aSO is provided to BMU 301 from, for example, a 40-bit wide first source accumulator included in 
external accumulator 302. The shift value aSl may be limited to 6-bits by limiter 310. The saturation 
mode value SAT is provided from an external control register (not shown) that is set by the processor. If 

1 5 the saturation mode value SAT is enabled, then the result generated by BMU 301 may be limited to a 
length of 32 bits. In addition, external input signals SHIFT and SHIFT LEFT may be generated by a 
controller (not shown in FIG. 3). The input value SHIFT identifies whether an arithmetic/logical shift 
(left or right) operation or a normalize operation is to be performed, and the signal SHIFT LEFT 
identifies whether the operation is an arithmetic shift left (SHIFT LEFT set) or an arithmetic shift right 

20 (SHIFT LEFT not set). 

Circuit 300 generates: (a) two 40-bit results aDO and aDl written to external accumulator 302 (or 
other register file), and (b) an overflow flag LMV generated in accordance with the embodiment of the 
present invention described subsequently. The input value SHIFT applied to multiplexer (mux) 311 
selects either 1) the shift value from limiter 310 for an arithmetic shift left or right operation or 2) the 
25 shift of the normalize operation generated by exponent/compare module 305. Barrel shifter 303 

performs a shift, either left or right, of the input data aSO based on the decoded input SV from mux 311. 
Barrel shifter 303 may include a decoder to translate the value SV from mux 311 into 1) the number of 
bit positions to shift the input data aSO and 2) whether to shift the input data aSO left or right. 

The result aDO is the result of a barrel shift operation by barrel shifter 303, where the length of 
30 the result may be limited, if enabled, by saturate module 304. When the overflow flag LMV is not set, 
the barrel shifted result may be represented in 32 bits and limiting by saturate module 304 is not 
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necessary (SAT is in a "don't care" state). 

When the overflow flag LMV is set, the barrel shifted result may not necessarily be represented in 
32 bits, (i.e., an overflow occurs). Whether a 32-bit or 40-bit result aDO is written to the external 
accumulator is determined by logic AND gate 308 in response to the logic AND of the overflow flag 
5 LMV and the saturation mode value SAT If saturation mode value SAT is enabled, then the output of 
AND gate 308 causes saturate module 304 to limit the result aDO to the range OxffBOOOOOOO to 
0x007fffffff (Hex). If saturation mode value SAT is disabled, then the output of AND gate 308 causes 
saturate module 304 passes the entire 40-bit result aDO to the external accumulator 302. 

For the normalize operation, the BMU 301 shifts the input bits an appropriate shift amount to 
1 0 represent the value with 32 bits, where the result aDl is the exponent for the normalize operation. The 
(e.g., 6-bit) exponent is generated by exponent/compare module 305 based on the input data aSO and the 
output from mux 323. 

Generation of the overflow flag LMV by BMU 301 is now described. In accordance with the 
exemplary implementation shown in FIG. 3, two separate quantities are generated: LMViEFT 
1 5 representing an overflow value for an arithmetic shift left operation and LMVjuqjjt representing an 

overflow value for an arithmetic shift right operation. The two quantities LMV r LEFT LMVrjqht are 
provided to multiplexer (mux) 325 that selects either LMV^EFT or LMV RIGHT as tlie output overflow 
flag LMV based on the signal SHIFT LEFT. 

For an arithmetic shift left operation, the overflow flag LMV of BMU 301 may be set two ways: 
20 1) if the shift amount is greater than the number of redundant sign bits, or 2) if the bits corresponding to 
the guard bits and the most significant bit (MSB) of the barrel shift operation result (e.g., bits 39 to 31 of 
the result aDO of the exemplary implementation) are not all the same. An equivalent method of 
computing the overflow flag LMV for an arithmetic left shift operation is to first add one plus the number 
of guard bits (e.g., nine for the exemplary implementation described subsequently) to the shift amount 
25 aSl 9 and then detect the number of redundant sign bits. This method indicates if a redundant sign bit has 
been shifted off or if the "next" (e.g., nine) bits of the input aSO are shifted into the guard positions of the 
result aDO. This component of the overflow flag for an arithmetic shift left operation is referred to 
herein as LMV left, and LMV left may be generated with a comparison in exponent/compare module 
305 of BMU 301 by implementing the following pseudo-code: 

30 1 00 if (number of redundant sign bits > (9 + shift amount)) { 

101 LMV left = 0; 
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102 } else { 

103 LMV LE FT= 1; 

104 } 

For the exemplary implementation of FIG. 3, exponent/compare module 305 computes LMVleFT using 
5 the output value of mux 323. 6-bit adder/saturator 324 adds nine to the shift amount provided by mux 
311 based on the signal SHIFT. Mux 323 is employed by BMU 301 to select, based on the signal SHIFT 
LEFT, either 1) the shift amount SV from mux 311 for either an arithmetic shift right, logic shift right, or 
normalize operations; or 2) the output value of 6-bit adder/saturator 324 for an arithmetic shift left 
operation 

10 For an arithmetic shift right operation, BMU 301 needs to determine if a set of most significant 

bits (the quantity of one plus the number of guard bits, and, e.g., the most significant bits 39 to 31 of the 
exemplary implementation) in the output result of barrel shifter 303 are not all the same. The total 
number of redundant sign bits in the result after the barrel shift operation of barrel shifter 303 is the 
number of redundant sign bits in the input aSO plus the shift amount since a shift right adds sign bits. If 

1 5 the number of sign bits in the input is added to the shift amount and the total is less than one plus the 
number of guard bits (e.g., nine bits for the exemplary implementation), then an overflow occurs for the 
operation. If an overflow occurs, the result for this case sets the overflow flag LMVrjqhtXo one; 
otherwise, LMVrjqht gets set to zero. The value for LMVjuQfjjr may be generated by the following 
pseudo-code: 



20 110 if (number of redundant sign bits + shift amount) > 9 { 

111 LMVjuGHT=0l 

112 }else{ 

113 LAfV RIGHT =l; 

114 } 



25 Circuit 300 generates LMVrjqhj as follows. The guard bits plus the MSB of the input data aSO 

from external accumulator 302 (i.e., aS0[39:31J) are applied to exponent module 320 to compute the 4- 
bit value for the exponent. Sign-extend module 321 prepends two bits representing the sign to the 4-bit 
value to generate a 6-bit value for the exponent. The 6-bit value for the exponent represents the number 
of redundant sign bits in the input data aSO. The 6-bit value for the exponent generated by sign-extend 

30 module 321 is then added to the shift amount provided from mux 323 by add, saturate, and compare 

IDS# 12 1954 (990.0316) 7 Case Name AUdina 1 1-7 



(ADD/SAT/compare) module 322. ADD/SAT/compare module 322 compares the number of total 
redundant sign bits to the total number of guard bits (e.g., 9-bits) to determine whether to set LMVrjghT 
as zero or one. The output of ADD/SAT/compare module 322 is provided to mux 325. Mux 325 selects 
either LMVlefT or LMV RIGHT as the overflow flag LMV output by BMU 301 to a status register (not 
5 shown) based on the signal SHIFT LEFT. 

A BMU operating in accordance with an exemplary embodiment of the present invention may 
provide the following advantages. FIG. 4(a) shows the normalized, relative circuit delay for steps 
generating the value of LMV for the prior art circuit 200 of FIG. 2. FIGs. 4(b) and 4(c) show the 
normalized, relative circuit delays for steps generating the value of LMV for the arithmetic shift right and 

10 arithmetic shift left operations for the circuit 300 of FIG. 3. Comparing FIG. 4a with FIGs. 4b and 4c, 
the circuit 300 exhibits at least a 25-percent improvement in speed over the prior art circuit 200 of FIG. 
2. Furthermore, since a BMU only performs one operation per clock cycle, the additional logic added in 
an implementation may be shared by other processor functions, and thus a BMU may be optimized for 
area occupied in an integrated circuit. For instance, the 6-bit adder/saturator 324 of FIG. 3 may be 

1 5 shared and its inputs multiplexed depending on shift right or shift left operations. 

While the present invention is described for a BMU having 40-bit wide external accumulators 
and allowing for 32-bit representation of results, 8 guard bits, and maximum shift amounts of 31, the 
characteristics of the described implementation are exemplary only. One skilled in the art would 
recognize that the present invention is not so limited and the techniques described herein may be 
20 extended to other lengths and types of input data, for any size accumulator, to different numbers of guard 
bits, and to different shift amounts. The present invention may also be extended to other types of 
operations that may be executed by a BMU. 

While the exemplary embodiments of the present invention have been described with respect to 
methods or processes of circuits, the present invention is not so limited. As would be apparent to one 
25 skilled in the art, various functions may also be implemented in the digital domain as processing steps in 
a software program. Such software may be employed in, for example, a digital signal processor, micro- 
controller or general purpose computer. 

The present invention can be embodied in the form of methods and apparatuses for practicing 
those methods. The present invention can also be embodied in the form of program code embodied in 
30 tangible media, such as floppy diskettes, CD-ROMs, hard drives, or any other machine-readable storage 
medium, wherein, when the program code is loaded into and executed by a machine, such as a computer, 
the machine becomes an apparatus for practicing the invention. The present invention can also be 
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embodied in the form of program code, for example, whether stored in a storage medium, loaded into 
and/or executed by a machine, or transmitted over some transmission medium, such as over electrical 
wiring or cabling, through fiber optics, or via electromagnetic radiation, wherein, when the program code 
is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for 
5 practicing the invention. When implemented on a general-purpose processor, the program code 

segments combine with the processor to provide a unique device that operates analogously to specific 
logic circuits. 

It will be further understood that various changes in the details, materials, and arrangements of 
the parts which have been described and illustrated in order to explain the nature of this invention may be 
1 0 made by those skilled in the art without departing from the principle and scope of the invention as 
expressed in the following claims. 
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What is claimed is: 



CLAIMS 



1 1 . An apparatus for setting an overflow flag based on a shift value and input data for one of 

2 a plurality of operations executed in a bit manipulation unit (BMU), the apparatus comprising: 

3 a first logic circuit, for a first subset of the plurality of operations, setting a first overflow value 

4 based on the left shift value and the input data; 

5 a second logic circuit, for a second subset of the plurality of operations, setting a second 

6 overflow value based on the right shift value and the input data; and 

7 a selector providing either the first or the second overflow value as the overflow flag based on a 

8 signal identifying the one of the plurality of operations executed by the BMU. 

1 2. The invention as recited in claim 1, wherein the first logic circuit sets the first overflow 

2 value by: 

3 1) comparing a number of redundant sign bits to a combination of the number of most significant 

4 bits (MSBs) of the input data and the shift value, and 

5 2) setting the first overflow value based on the comparison, wherein 

6 the most significant bits correspond to bits at positions for a set of guard bits in the input data 

7 and at least one bit of the input data. 

1 3. The invention as recited in claim 2, wherein the first logic circuit comprises: 

2 a saturator forming the combination; and 

3 an exponent and compare module comparing the number of redundant sign bits based on the 

4 combination. 

1 4. The invention as recited in claim 1, wherein the second logic circuit sets the second 

2 overflow value by: 

3 1) comparing the number of redundant sign bits to a combination of the most significant bits of 

4 the input data and the shift value, and 

5 2) setting a second overflow value based on the comparison, wherein 
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6 the most significant bits correspond to bits at positions for a set of guard bits in the input data 

7 and at least one bit of the input data. 

1 5. The invention as recited in claim 4, wherein the second logic circuit comprises: 

2 an exponent module generating an exponent for the input data, the exponent module including 

3 means for prepending a sign to the exponent to generate a value corresponding to the number of 

4 redundant sign bits; 

5 a combiner adding the shift amount to the value corresponding to the number of redundant sign 

6 bits to form a combined value; 

7 a comparator generating the comparison as the difference between the combined value and the 

8 number of most significant bits of the input data. 

1 6. The invention as recited in claim 1, wherein the apparatus is a circuit, and wherein: 

2 the first logic circuit comprises: 

3 a saturator forming a combination of the number of most significant bits (MSBs) of the input 

4 data and the shift value, the most significant bits corresponding to bits at positions for a set of guard 

5 bits in the input data and at least one bit of the input data; and 

6 an exponent and compare module comparing a number of redundant sign bits based on the 

7 combination, exponent and compare module setting the first overflow value based on the 

8 comparison; and 

9 the second logic circuit comprises: 

1 0 an exponent module generating an exponent for the input data, the exponent module including 

1 1 means for prepending a sign to the exponent to generate a value corresponding to a number of redundant 

1 2 sign bits; 

1 3 a combiner adding the shift amount to the value corresponding to the number of redundant sign 

14 bits to form a combined value; 

15 a comparator generating the comparison as the difference between the combined value and the 

16 most significant bits of the input data, wherein the second overflow value is set based on the comparison. 

1 7. The invention as recited in claim 1 , wherein the apparatus is embodied as a circuit. 
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1 8. The invention as recited in claim 7, wherein the circuit is embodied in an integrated 

2 circuit. 

1 9. The invention as recited in claim 1, wherein the apparatus is embodied in either a digital 

2 signal processor, microprocessor, micro-controller, or application-specific integrated circuit. 

1 1 0. A method of setting an overflow flag based on a shift value and barrel-shifted input data 

2 for one of a plurality of operations executed in a bit manipulation unit (BMU), the method comprising 

3 the steps of: 

4 (a) for a first subset of the plurality of operations, setting a first overflow value based on the left 

5 shift value and the input data; 

6 (b) for a second subset of the plurality of operations, setting a first overflow value based on the 

7 right shift value and the input data; and 

8 (c) selecting either the first or the second overflow value as the overflow flag based on a signal 

9 that identifies the one of the plurality of operations executed by the BMU. 

1 11. The invention as recited in claim 10, wherein step (a) comprises the steps of: 

2 (al) comparing a number of redundant sign bits to a combination of the number of most 

3 significant bits of the input data and the shift value, wherein the most significant bits correspond 

4 to bits at positions for a set of guard bits in the input data and at least one bit of the input data, 

5 and 

6 (a2) setting the first overflow value based on the comparison;. 

1 12. The invention as recited in claim 10, wherein step (b) comprises the steps of: 

2 (bl) comparing a number of redundant sign bits to a difference between most significant 

3 bits of the barrel-shifted input data and the shift value, wherein the most significant bits 

4 correspond to bits at positions for a set of guard bits in the input data and at least one bit of the 

5 input data, and 

6 (b2) setting a second overflow value based on the comparison. 

1 13. The invention as recited in claim 10, wherein the method is embodied in a processor in 

2 an integrated circuit. 

1 14, The invention as recited in claim 10, wherein the method is embodied in a processor of 

2 either a digital signal processor, microprocessor, micro-controller, or application-specific integrated 
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3 circuit. 

1 1 5 . A computer-readable medium having stored thereon a plurality of instructions, the 

2 plurality of instructions including instructions which, when executed by a processor, cause the processor 

3 to implement a method for setting a overflow flag based on a shift value and barrel-shifted input data for 

4 one of a plurality of operations executed in a bit manipulation unit (BMU), the method comprising the 

5 steps of: 

6 (a) for a first subset of the plurality of operations, setting a first overflow value based on the left 

7 shift value and the input data; 

8 (b) for a second subset of the plurality of operations, setting a second overflow value based on 

9 the right shift value and the input data; and 

1 0 (c) selecting either the first or the second overflow value as the overflow flag based on a signal 

1 1 that identifies the one of the plurality of operations executed by the BMU. 
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FAST COMPUTATION OF OVERFLOW FLAG IN A BIT MANIPULATION UNIT 

ABSTRACT OF THE DISCLOSURE 
A bit manipulation unit (BMU) scales and formats data and includes fast computation of the 
5 overflow flag. For fast computation the BMU's overflow flag is computed based on the input data and 
the shift amount. The overflow flag is calculated separately as either a LMVi e ft for an arithmetic shift 
left operation or LMV r ight for an arithmetic shift right operation. For an arithmetic shift left operation, 
LMVfefi may be computed by first adding one plus the number of guard bits in the input data to the shift 
amount, and then detecting the number of redundant sign bits. For an arithmetic shift right operation, 
10 LMV r fgfo t may be computed by checking the input redundant sign bits plus the right shift amount. By 
computing the overflow flag separately as LMV{ e f t and LMV r igfa for arithmetic left and right shifts, 
respectively, the overflow flag LMVis determined in parallel with the barrel shift operation and so does 
not depend on the result from the barrel shift operation. Consequently, an advantage of employing this 
technique in a BMU may be a relative reduction in the time necessary for a BMU to calculate the 
15 overflow flag. 
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IN THE UNITED STATES 
PATENT AND TRADEMARK OFFICE 

Declaration and Power of Attorney 
As a below named inventor, I hereby declare that: 

My residence, post office address, and citizenship are as stated below next to my name. 

I believe I am the first, original, and sole inventor (if only one name is listed below) or a 
first, original, and joint inventor (if multiple names are listed below) of the subject matter which is 
claimed and for which a patent is sought on the invention entitled Fast Computation Of Overflow 
Flag In A Bit Manipulation Unit, the specification of which is being filed under the above- 
identified Attorney Docket Number. 

I hereby state that I have reviewed and understand the contents of the above identified 
specification, including the claims, as amended by an amendment, if any, specifically referred to in 
this oath or declaration. 

I acknowledge the duty to disclose all information known to me which is material to 
patentability as defined in Title 37, Code of Federal Regulations, 1.56. 

I hereby claim foreign priority benefits under Title 35, United States Code, 119 of any 
foreign application(s) for patent or inventor's certificate listed below and have also identified below 
any foreign application for patent or inventor's certificate having a filing date before that of the 
application on which priority is claimed: 

None 

I hereby claim the benefit under Title 35, United States Code, 1 19(e) of any United States 
provisional application(s) identified below: 

None 

I hereby claim the benefit under Title 35, United States Code, 120 of any United States 
application(s) listed below and, insofar as the subject matter of each of the claims of this application 
is not disclosed in the prior United States application in the manner provided by the first paragraph 
of Title 35, United States Code, 1 12, 1 acknowledge the duty to disclose all information known to 
me to be material to patentability as defined in Title 37, Code of Federal Regulations, 1.56 which 
became available between the filing date of the prior application and the national or PCT 
international filing date of this application: 

None 
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I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that 
such willful false statements may jeopardize the validity of the application or any patent issued 
thereon. 

I hereby appoint the following attorney(s) with full power of substitution and revocation, to 
prosecute said application, to make alterations and amendments therein, to receive the patent, and 
to transact all business in the Patent and Trademark Office connected therewith: 



Thomas J. Beam 
Lester H. Birnbaum 
Richard J. Botos 
Jeffery J. Brosemer 
Kenneth M. Brown 
Donald P. Dinella 
Guy Eriksen 
Martin I. Finston 
William S. Francos 
Barry H. Freedman 
Julio A. Garceran 
Jimmy Goo 
Anthony Grillo 
Stephen M. Gurey 
John M. Harman 
Matthew J. Hodulik 
Michael B. Johannesen 
Mark A. Kurisko 
Irena Lager 
John B. Maclntyre 
Christopher N. Malvone 
Scott W. McLellan 
Martin G. Meder 
John C. Moran 
Michael A. Morra 
Gregory J. Murgia 



(Reg. No. 44528) 
(Reg. No. 25830) 
(Reg. No. 32016) 
(Reg. No. 36096) 
(Reg. No. 37590) 
(Reg. No. 39961) 
(Reg. No. 41736) 
(Reg. No. 31613) 
(Reg. No. 38456) 
(Reg. No. 26166) 
(Reg. No. 37138) 
(Reg. No. 36528) 
(Reg. No. 36535) 
(Reg. No. 27336) 
(Reg. No. 38173) 
(Reg. No. 36164) 
(Reg. No. 35557) 
(Reg. No. 38944) 
(Reg. No. 39260) 
(Reg. No. 41170) 
(Reg. No. 34866) 
(Reg. No. 30776) 
(Reg. No. 34674) 
(Reg. No. 30782) 
(Reg. No. 28975) 
(Reg. No. 41209) 



990.0316 



Customer No.: 22186 



-3- 



Attorney Docket No.: Alidina 11-7 



Claude R. Narcisse (Reg. No. 38979) 

Joseph J. Opalach (Reg. No. 36229) 

Neil R. Ormos (Reg. No. 35309) 

Eugen E. Pacher (Reg. No. 29964) 

Jack R. Penrod (Reg. No. 3 1 864) 

Gregory C. Ranieri (Reg. No. 29695) 

Scott J. Rittman (Reg. No. 390 1 0) 

Ferdinand M. Romano (Reg. No. 32752) 

Eugene J. Rosenthal (Reg. No. 3 665 8) 

Bruce S. Schneider (Reg. No. 27949) 

Ronald D. Slusky (Reg. No. 26585) 

David L. Smith (Reg. No. 30592) 

Ozer M. N. Teitelbaum (Reg. No. 36698) 

John P. Veschi (Reg. No. 39058) 

David Volejnicek (Reg. No. 29355) 

Charles L. Warren (Reg. No. 27407) 

Jeffrey M. Weinick (Reg. No. 36304) 

Eli Weiss (Reg. No. 17765) 



I hereby appoint the attorney(s) on ATTACHMENT A as associate attorney(s) in the 
aforementioned application, with full power solely to prosecute said application, to make alterations 
and amendments therein, to receive the patent, and to transact all business in the Patent and 
Trademark Office connected with the prosecution of said application. No other powers are granted 
to such associate attorney(s) and such associate attorney(s) are specifically denied any power of 
substitution or revocation. 



Full name of 1 st inventor: Mazhar M. Alidina 



Inventor's signature. 



Date 



Residence: Allentown. Lehigh County, Pennsylvania 
Citizenship: USA 



Post Office Address: 



1290 Ascot Circle 



Allentown. Pennsylvania 18103 
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Full name of 2 nd inventor: Alexander Goldovskv 

Inventor's signature. Dat e ^ Z ~ Z &s^ 

Residence: Philadelphia. Philadelphia County, Pennsylvania 
Citizenship: Russia 

Post Office Address: 9346 A Jamison Street 

Philadelphia. Pennsylvania 19115 
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ATTACHMENT A 

Attorney Name(s): Steve Mendelsohn Reg. No.: 35.951 

Ian M. Hushes 41.083 

Telephone calls should be made to Mendelsohn & Associates, P.C. at: 
Phone No.: 215-557-6657 
Fax No.: 215-557-8477 



All written communications are to be addressed to: 

Steve Mendelsohn 
Mendelsohn & Associates, P.C. 
1515 Market Street, Suite 715 
Philadelphia, PA 19102 

Customer No.: 22186 
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